Validating Documentation with Domain Ontologies

نویسندگان

  • Leonid Kof
  • Markus Pizka
چکیده

Do we always use the same name for the same concept? Usually not. While misunderstandings are always troublesome, they pose particularly critical problems in software projects. Requirements engineering deals intensively with reducing the number and scope of misunderstandings between software engineers and customers. Software maintenance is another important task where proper understanding of the application domain is vital. In both cases it is necessary to gain (or regain) domain knowledge from existing documents that are usually inconsistent and imprecise. This paper proposes to reduce the risk of misunderstandings by unifying the terminology of the different stakeholders with the help of an ontology. The ontology is constructed by extracting terms and relations from existing documents. Applying text mining for ontology extraction has an unbeatable advantage compared to manual ontology extraction: Text mining detects terminology inconsistencies before they are absorbed in the ontology. In addition to this, the approach presented in this paper also introduces an explicit validation of ontology gained by text mining. 1. Documents are Always Inconsistent Usually, some kind of requirements document is written in the beginning of a software project. After requirements elicitation, one of the first tasks of the software developer is to understand the requirements document which includes trying to understand the terminology used. But practical experiences show that apart from being imprecise, requirements documents also use inconsistent terminology. A simple steam boiler specification [1], written for a formal methods contest, for example, looked extremely precise at first glance. However, the document called the same measuring unit in different places “water level measurement device”, “water level measuring unit”, “device to measure the quantity of water”, . . . . Obviously, this unwanted obfuscation hampers understanding of the domain. The reader can not be sure whether there is just one unit or two or three different devices. And of course, real life specifications, not written for an academic formal methods contest, are very likely even less consistent. Furthermore, real life documents are usually much longer rendering manual detection and resolution of such inconsistencies virtually impossible. 1Correspondence to: Leonid Kof, Fakultaet fuer Informatik, Technische Universitaet Muenchen, Boltzmannstr. 3, D-85748, Garching bei Muenchen, Germany Tel.: +49 89 289-17834; Fax: +49 89 289-17307; E-mail: [email protected].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Validating Top-level and Domain Ontology Alignments using WordNet

Matching domain and top-level ontologies is an important task but still an open problem in the ontology matching field, particularly due to their different levels of abstraction. Beyond that, validating candidate alignments is crucial before exploiting them within ontology construction and integration processes involving such kinds of ontologies. This paper concerns the automatic validation of ...

متن کامل

From Software APIs to Web Service Ontologies: A Semi-automatic Extraction Method

Successful employment of semantic web services depends on the availability of high quality ontologies to describe the domains of these services. As always, building such ontologies is difficult and costly, thus hampering web service deployment. Our hypothesis is that since the functionality offered by a web service is reflected by the underlying software, domain ontologies could be built by ana...

متن کامل

The Research on Model Transformations, Based on Domain Metamodel, for Designing Requirements Specifications

ion [26]. 3 Roles of ontologies in software engineering Ontologies, for software design and development, can be used with the following objectives [29] [26]: • Specification: ontologies are used to specify either the requirements and components definitions (informal use) or the system ́s functionality. • Confidence: ontologies are used to check the system ́s design. • Reusability: ontologies coul...

متن کامل

Ontology-based Land Degradation Assessment from Satellite Images

In this paper, we introduce the idea of documenting operational chains for land degradation assessment using ontologies. We believe that this will help end-users in better understanding the land degradation characteristics and evaluate the results of the assessment process. Since the application domain is wide, various operational chains for land degradation assessment and their associated docu...

متن کامل

Definitions Management: A Semantics-Based Approach for Clinical Documentation in Healthcare Delivery

Structured Clinical Documentation is a fundamental component of the healthcare enterprise, linking both clinical (e.g., electronic health record, clinical decision support) and administrative functions (e.g., evaluation and management coding, billing). Documentation templates have proven to be an effective mechanism for implementing structured clinical documentation. The ability to create and m...

متن کامل

Learning Web Service Ontologies: an Automatic Extraction Method and its Evaluation

Successful employment of semantic Web services depends on the availability of high quality ontologies to describe the domains of these services. Building such ontologies is difficult and costly, thus hampering Web service deployment. As a solution, we describe an automatic extraction method that learns domain ontologies from textual documentations attached to Web services. We evaluate our metho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005